[GSoC] Blockwise Quantization Tool #265

DaniAffCH · 2024-07-07T15:04:36Z

This PR introduces a Python tool for block quantizing ONNX models. The quantized models adhere to the ONNX standard, verified using onnx.checker.check_model(self.model, full_check=True).

Additionally, these block-quantized models are compatible with the QuantizeLinear and DequantizeLinear layers in OpenCV, as introduced in opencv/opencv#25644, allowing them to be executed within the OpenCV DNN engine.

The tool currently performs asymmetric weight-only quantization on Convolutional Layers. It's possible to specify the desired quantization block size.
The quantization is applied along axis 1, flattening convolutional weights $[C_{out}, C_{in}, K_w, K_h] \rightarrow [C_{out}, C_{in} \times K_w \times K_h]$.
Future enhancements could extend the tool capabilities, making it more customizable and general.

The tool also provides a quantization summary, reporting the overall quantization mean squared error and the initial and final model size.

Testing

When employing a block size of 16 and normalized input images, the mean squared quantization error was found to be in the order of magnitude of $10^{-2}$ or $10^{-3}$.

Furthermore, a qualitative assessment was conducted by quantizing some models of this repository in a blockwise manner and executing them, comparing the results with the original model and int8 model.

The findings indicate that, with even block size, the block quantized model maintains performance levels equivalent to those of the original model while achieving a reduction in model size.

Here is an example applied to face detection yunet:
Loading the following gif may take some time because of the gif size

Onnxruntime and DNN comparison

The resulting models have been tested using both onnxruntime and OpenCV DNN, both of which produced identical outputs for the same input data.
Since onnxruntime introduced the support for blockwise quantization inference recently and such functionality has not been included in the last release, the only way to test it is to build onnxruntime from the source.

./build.sh --config=Release --build_shared_lib --disable_ml_ops --build_wheel --enable_reduced_operator_type_support --skip_tests --parallel

Then install with pip the resulting wheel:

pip install -U build/Linux/Release/dist/onnxruntime-1.19.0-cp39-cp39-linux_x86_64.whl

To test the resulting networks with OpenCV DNN you have to build the pull request opencv/opencv#25644

tools/quantize/block_quantize.py

fengyuentau

LGTM👍 I propose to merge this one after opencv/opencv#25644.

DaniAffCH added 2 commits July 7, 2024 16:17

Blockwise quantization tool

1f4cbba

add missing type hints

846d893

fengyuentau reviewed Jul 8, 2024

View reviewed changes

tools/quantize/block_quantize.py Show resolved Hide resolved

tools/quantize/block_quantize.py Outdated Show resolved Hide resolved

tools/quantize/block_quantize.py Outdated Show resolved Hide resolved

DaniAffCH added 2 commits July 8, 2024 11:22

add min python version check

a536245

refactoring

d83d6b2

fengyuentau self-assigned this Jul 16, 2024

fengyuentau added feature New feature or request quantization Anything related to model quantization labels Jul 16, 2024

fengyuentau added this to the 4.10.0 milestone Jul 16, 2024

fengyuentau approved these changes Jul 16, 2024

View reviewed changes

fengyuentau merged commit ac83ef3 into opencv:main Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GSoC] Blockwise Quantization Tool #265

[GSoC] Blockwise Quantization Tool #265

Uh oh!

DaniAffCH commented Jul 7, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fengyuentau left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[GSoC] Blockwise Quantization Tool #265

[GSoC] Blockwise Quantization Tool #265

Uh oh!

Conversation

DaniAffCH commented Jul 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Onnxruntime and DNN comparison

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fengyuentau left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DaniAffCH commented Jul 7, 2024 •

edited

Loading